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Abstract: A Markov chain is geometrically ergodic if it converges to its in- 
variant distribution at a geometric rate in total variation norm. We study geo- 
metric ergodicity of deterministic and random scan versions of the two- variable 
Gibbs sampler. We give a sufficient condition which simultaneously guarantees 
both versions are geometrically ergodic. We also develop a method for simul- 
taneously establishing that both versions are subgeometrically ergodic. These 
general results allow us to characterize the convergence rate of two-variable 
Gibbs samplers in a particular family of discrete bivariate distributions. 



1. Introduction 

Let U7 be a probability distribution having support XxYCIR fe xIR',£;,^>l and 
wx\y and t^y\x denote the associated conditional distributions. We assume it is 
possible to simulate directly from w x \y and Wy\x- Then there are two Markov 
chains having w as their invariant distribution, each of which could be called a 
two-variable Gibbs sampler (TGS). The most common version of a TGS is the de- 
terministic scan Gibbs sampler (DGS), which is now described. Suppose the current 
state of the chain is (X n , Y n ) = (x, y), then the next state, (X n+ i, Y n+ \), is obtained 
as follows. 

Iteration n + 1 of DGS: 

1. Draw X n+ i ~ ^x\y{'\v)j an d call the observed value x'. 

2. Draw Y n+1 — w Y \x(-\ x ')- 



An alternative TGS is the random scan Gibbs sampler (RGS). Fix p £ (0,1) 
and suppose the current state of the RGS chain is (X n , Y n ) = (x, y). Then the next 
state, (X n+ i,Y n+ i), is obtained as follows. 
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Iteration n + 1 of RGS: 

1. Draw B ~ Bernoulli(p). 

2. If B = 1, then draw X n+ i ~ n7x|y('|y) an d set 5^+i = y. 

3. If .B = 0, then draw Y n+ i ~ n7y|j>f (-|:r) an d se t -Xn+i = x. 



Despite the simple structure of either TGS, these algorithms are widely appli- 
cable in the posterior analysis of complex Bayesian models. A TGS also arises 
naturally when vj is created via data augmentation techniques (Hobcrt, 2011; 
Tanner and Wong, 1987). 

Inference based on vj often requires calculation of an intractable expectation. 
Let j:XxY^M and let E^g denote the expectation of g with respect to vj. If a 
TGS Markov chain is ergodic (see Tierney, 1994) and E^\g\ < oo, then 

^ n—l 

9n ■= - g(Xi,Yi) ^> E^g as n -> oo. 
n ' 

Thus estimation of E^g is simple. However, the estimator g n will be more valuable 
if we can attach an estimate of the unknown Monte Carlo error g n — E^g. An 
approximation to the sampling distribution of the Monte Carlo error is available 
when a Markov chain central limit theorem (CLT) holds 

-Jn(g„ - E^g) A- N(0, a 1 ) as n -> oo 

with < <jg < oo. The variance a 2 g accounts for the serial dependence in a TGS 
Markov chain and consistent estimation of it requires specialized techniques such as 
batch means, spectral methods or regenerative simulation. Let <r„ be an estimator of 
<jg. If, with probability 1, <r„ — > <jg as n — > oo, then an asymptotically valid Monte 
Carlo standard error is & n /\/n. These tools allow the practitioner to use the results 
of a TGS simulation with the same level of confidence that one would have if the ob- 
servations were a random sample from vj. For more on this approach the interested 
reader can consult Gcycr (1992), Geyer (2011), Flcgal, Haran and Jones (2008), 
Flegal and Jones (2010), Flegal and Jones (2011), Hobert et al. (2002), Jones et al. 
(2006), and Jones and Hobert (2001). 

The CLT will obtain if E VJ \g\ 2+e < oo for some e > and the Markov chain is 
rapidly mixing (Chan and Geyer, 1994). In particular, we require that the Markov 
chain be geometrically ergodic; that is, converge to the target vj in total variation 
norm at a geometric rate. Under these same conditions methods such as batch 
means and regenerative simulation provide strongly consistent estimators of a 2 g . 
Thus establishing geometric crgodicity is a key step in ensuring the reliability of a 
TGS as a method for estimating features of vj. 

The convergence rate of DGS Markov chains has received substantial atten- 
tion. In particular, sufficient conditions for geometric ergodicity have been devel- 
oped for several DGS chains for practically relevant statistical models (see e.g. 
Hobert and Geyer, 1998; Johnson and Jones, 2010; Jones and Hobert, 2004; Marchev and Hobert, 
2004; Roberts and Poison, 1994; Roberts and Rosenthal, 1999; Roman, 2012; Roman and Hobcrt, 
2011; Rosenthal, 1996; Roy and Hobert, 2007; Tan and Hobert, 2009). The conver- 
gence rates of RGS chains has received almost no attention despite sometimes 
being useful. Liu, Wong and Kong (1995) did investigate geometric convergence of 
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RGS chains, but the required regularity conditions are daunting and, to our knowl- 
edge, have not been applied to practically relevant statistical models. Recently 
Johnson, Jones and Neath (2011) gave conditions which simultaneously establish 
geometric ergodicity of both the DGS chain and the corresponding RGS chain. 
These authors also conjectured that if the RGS chain is geometrically ergodic, then 
so is the DGS chain. That is to say, the qualitative convergence properties of TGS 
chains coincide. We are not able to resolve this conjecture in general, but in our 
main application (see Section 5) this is indeed the case. 

A TGS chain which converges subgeometrically (ie, slower than geometric) would 
not be as useful as another chain which is geometrically ergodic-although with 
additional moment conditions it is still possible for a CLT to hold (Jones, 2004). 
Thus it would be useful to have criteria to check for subgeometric convergence. We 
are unaware of any previous work investigating subgeometric convergence of TGS 
Markov chains. 

In the rest of this paper, we extend the results of Johnson, Jones and Neath 
(2011) and provide a condition which can be used to simultaneously establish ge- 
ometric ergodicity of DGS and RGS Markov chains. We then turn our attention 
to development of a condition which ensures that both the DGS and RGS chains 
converge subgeometrically. Finally, we apply our results to a class of bivariate dis- 
tributions where we are able to characterize the convergence properties of the DGS 
and RGS chains. But we begin with some Markov chain background and a formal 
definition of the Markov chains we study. 

2. Background and Notation 

Let Z be a topological space and B(Z) denote its Borel c-algebra. Also, let $ = 
{Zq, Zi, Z2, ■ ■ ■} be a Markov chain having Markov transition kernel P. That is, 
P : Z x B(Z) — > [0,1] such that for each A G B(Z), P(-,A) is a nonncgativc 
measurable function and for each z G Z, P(z, ■) is a probability measure. As usual, 
P acts to the left on measures so that if v is a measure on (Z,£>(Z)) and A G B(Z), 
then 



For any n G Z + , the n-step Markov transition kernel is given by P n (z,A) — 



Let w be an invariant probability measure for P, that is, wP = w. Denote total 
variation norm by || • \\tv- If $ is ergodic, then for all z G Z we have ||P"(z, •) — 
w(-)IItv — > as n — > 00. Our goal is to study the rate of this convergence. Suppose 
there exist a real-valued function M(z) on Z and < t < 1 such that for all z 



Then $ is geometrically ergodic, otherwise it is subgeometrically ergodic. 
2.1. Two-variable Gibbs samplers 

In this section we define the Markov kernels associated with the DGS and RGS 
chains described in Section 1. We also introduce a third Markov chain which will 
prove crucial to our study of the other Markov chains. 




Pr(Z n+J G A\Zj = z) 



(1) 



\\P n (z,-)-w(-)\\ TV <M(z)t 



.71 
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Recall that w is a probability distribution having support X x Y C M. k x M 1 , 
k,l > 1. Let 7r(x, j/) be a density of uj with respect to a measure p. = fix x py. 
Then the marginal densities are given by 

and similarly for 7iy(y). The conditional densities arc ix x \ Y {x\y) — 7T (x,y)/'!TY(y) 
and TT Y \x(y\x) = ■K(x,y)/-K X {x). 

Consider the DGS Markov chain $ DGS = {(X ,Y ), (X 1 ,Y 1 ), . . .} and let 

kDGs(x',y'\x,y) =-Kx\Y(x'\y)<K Y \x{y'W) ■ 
Then the Markov kernel for $ dqs is defined by 

P DGS ((x, y),A)= ( k DGS (x', y'\x, y)Kd(x', y')) A e B(X) x B(Y) . 

J A 

It is well known that the two marginal sequences comprising $ dgs are themselves 
Markov chains (Liu, Wong and Kong, 1994). We now consider the marginal se- 
quence $x = {-^0! Xi,...} and dchne 

k x (x'\x) = J n x \Y(x'\y)n Y \x(y\x)HY(dy) . 

The Markov kernel for $x is 

P x {x,A)= [ k x {x'\x)(j, x (dx') A e B(X) . 

J A 

Note that Pdgs has w as its invariant distribution while Px has the marginal vox 
as its invariant distribution. 

Finally, consider the RGS Markov chain Qrgs = {(-^0, Yo), (X\, Y%), . . .}. Let 
p G (0, 1) and S denote Dirac's delta. Define 

knGs(x',y'\x,y) =p7r X \Y(x'\y)S(y l - y) + (1 - p)ir Y \ x {y'\x)8(x' - x) . 

Then the Markov kernel for Qrgs is 

PnGs((x,y),A) = / k R Gs{x',y'\x,y)tJ,(d(x',y')) 
J A 

It is easy to show via direct computation that w is invariant for Prgs- 

It is well known that Px and Pdgs converge to their respective invariant distri- 
butions at the same rate (Diaconis, Kharc and Saloff-Costc, 2008; Liu, Wong and Kong, 
1994; Robert, 1995; Roberts and Rosenthal, 2001). Thus if one is geometrically er- 
godic, then so is the other. This relationship has been routinely exploited in the 
study of TGS chains for practically relevant statistical models (cf. Hobert and Geyer, 
1998; Johnson and Jones, 2010; Jones and Hobert, 2004; Roy and Hobert, 2007; 
Tan and Hobert, 2009) since one of the two chains may be easier to analyze than 
the other. Recently, Johnson, Jones and Neath (2011) connected the geometric er- 
godicity of P x to that of Prgs- Thus establishing geometric ergodicity of TGS 
algorithms often comes down to analyzing Px- This is exactly the approach we 
take in Sections 3 and 5. 
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3. Conditions for Geometric Ergodicity 

In this section we develop general conditions which ensure that Px, Pdgs and 
Prgs are geometrically ergodic. First we need a couple of concepts from Markov 
chain theory. Recall the notation from Section 2. That is, P is a Markov kernel 
on (Z,S(Z)). Then P is Feller if for any open set O € B(Z), P(-,0) is a lower 
semicontinuous function. The Markov kernel P acts to the right on functions so 
that for measurable / 

Pf(z) = Jj{z')P{z,dz>) . 

A drift condition holds if there exists a function U : Z — > R + , and constants 
< A < 1 and L < oo satisfying 

(2) PU(z) < XU(z) + L for all z e Z . 

Recall that a function U is said to be unbounded off compact sets if the sublevel set 
{z 6 Z : U(z) < d} is compact for every d > 0. If P is Feller, {/ is unbounded off com- 
pact sets and satisfies (2), then $ is geometrically ergodic. See Meyn and Tweedie 
(1993) and Roberts and Rosenthal (2004) for details while Jones and Hobert (2001) 
give an introduction to the use of drift conditions. 

3.1. Two-variable Gibbs samplers 

Johnson, Jones and Neath (2011) gave a set of conditions which simultaneously 
prove that &x, &dgs and &rgs are geometrically ergodic. We build on their work 
and show how a drift condition for Px naturally provides a drift condition for Prgs- 
This allows us to develop an alternative set of conditions which are sufficient for 
the geometric ergodicity of Px, Pdgs and Prgs- The application of this method 
is illustrated in Section 5. 

The following result was essentially proved by Johnson, Jones and Neath (2011), 
but it was not stated in their paper; see Johnson (2009) for related material. Thus 
we provide a proof for the sake of completeness. First we set some notation. Suppose 
V : X -> M+ and let 

G (y) = / V(x)TT X \Y(x\y)nx(dx) . 
Jx 

Also, for c > define 

(3) W(x,y) = V(x)+cG(y). 

Lemma 1. Suppose there exist constants < A < 1 and L < oo such that for all 

PxV(x) < XV (x) + L . 

If < p < 1 and p(l — p)^ 1 < c < p[A(l — p)]^ 1 , then there exists A < 7 < 1 such 
that 

PrgsW(x, y) < jWix, y) + (l- P )cL . 
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Proof. Notice that 

G(y)ir Y \ x (y\x)fi Y (dy) = / / V{x')TT X \ Y (x'\y)Tr Y \ x (y\x)fi x (dx')fi Y (dy) 

Jx 

V{x') I Tr x i Y {x'\y)n Y < x (y\x)fj, Y (dy)ii x (dx') 



V(x')k x (x'\x)fi x (dx') 
< XV(x) + L . 



Since 



(4) < c < 



1-p X(l-P) 



there exists 7 such that 



(5) (l-p)(cA + l)V P(1 + C ^ < 7 < 1- 

c 

Then 

PsGsW(x,y)= / / W(x',y')kR G s(x',y'\x,y)fj,x(dx')fj. Y (dy') 

JX JY 



p / W(x',y')Tr xlY (x'\y)6(y' -y)nx(dx')[iy(dy') 



X JY 



+ (l-p) / / W(x',y')ir Ylx (y'\x)5(x' ~ x)fix(dx')fiY(dy') 



X JY 

p f W(x',y)ir xlY (x'\y)LL X (dx') + (l-p) f W(x,y')n Ylx (y'\x)» Y (dy') 

JX JY 



= P / [V(x') + cG(y)]Tr xlY (x'\y)nx(dx') 
Jx 

+ (l-p)J [V(x) + cG(y')] 7r Ylx (y'\x)f, Y (dy') 
= pcG{y) + (l-p)V(x) + P G(y) + (1 - p)c J^G{y')^ Y \ X {y'\x)p Y {dy') 
= p(l + c)G(y) + (l-p)V(x) + (l- P )c f G{y')it Y \ x {y'\x)tJi Y {dy') 



Y 

< (1 - p)cXV(x) + (1 - p)cL + p(l + c)G(y) + (1 - p)V(x) 
= (1 - p){c\ + l)V(x) + p(l + c)G{y) + (1 - p)cL 

< ~/W(x,y) + (1 -p)cL . 

All that remains is to show that 7 > A. Now 

7> (l-p)(cA + l) by (5) 

>{l-p){j^ + l\ by (4) 
= p\+(l-p) 

> A since A, p G (0, 1) . 

□ 
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The following is an easy consequence of Lemma 1 and the material stated at the 
beginning of this section. 

Proposition 1. Suppose Px and Prgs are Feller. If there exists a function V : 
X — > M + such that both V and the corresponding W (as defined at (3) ) are un- 
bounded off compact sets, and there exist constants < A < 1 and L < oo such that 
for all ieX 

P x V(x) < XV(x) + L, 
then &x, ^DGS and Qrgs are geometrically ergodic. 

4. Conditions for Subgeometric Convergence 

Our goal in this section is to develop a condition which ensures that $x, $dgs 
and &rgs converge subgeometrically, but first we need a few concepts from general 
Markov chain theory. Recall the notation of Section 2. In particular, P is a Markov 
kernel on (Z, B(Z)) having invariant distribution w. A Markov kernel defines an op- 
erator on the space of measurable functions that are square integrable with respect 
to the invariant distribution, denoted L 2 (w). Also, let 

^o.iH = {/ e L 2 (w) : E w f = 0, and E w f 2 = l} . 

For f,g £ L 2 (w), define the inner product as 

(/,<?>= Jj(z)g(z)w(dz) 

and || /|| 2 = (/, /). The norm of the operator P is 

ll^ll = sup \\Pf\\ . 

If P is symmetric with respect to w, that is, if 

(6) P(z, dz')w(dz) = P(z', dz)w(dz'), 

then P is self-adjoint so that (Phi,h 2 ) = (hi,Ph,2)- If P is w-symmetric, then 
$ is geometrically ergodic if and only if ||P|| < 1 (Roberts and Rosenthal, 1997). 
Moreover, if Z ~ w and Z'\Z = z ~ P(z, ■), then 

(7) ||P||= sup |(P/,/>|= sup \E[f(Z')f(Z)]\. 

The first equality is a property of self-adjoint operators while the second equality 
follows directly from the definition of inner product. 

4-1- Two-variable Gibbs samplers 

It is easy to see that Px is n7x-symmetric and Prgs is w-symmetric, but Pdgs 
is not TO-symmetric. Because Px and Prgs are symmetric, the operator theory 
described above applies. In particular, if X ~ vox and X'\X = x ~ Px(x, ■), then 

||Px||= sup \E[f(X')f{X)]\ 
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while if (X,Y) ~ w and {X' ,Y')\{X,Y) = (x,y) ~ P RGS {{x,y),-), then 

\\Prgs\\= sup |S[/(X',y')/(^^)]l • 
/eig,i(^) 

Note that despite our use of | • || for both operator norms, these are different since 
they are based on different I? spaces. 

If we can show that ||Px| = H-PrgsII — L then we will be able to conclude 
that § x , ^DGSi an d ^rgs ar e subgeometrically ergodic. First, we need convenient 
characterizations of the operator norms. 

Lemma 2. // (X, Y) ~ zn, then 

||Pjc|| = 1- inf E{Var{f{X)\Y)) 

and 

||PflGs|| = l- mf { P E(Var(f(X,Y)\Y)) + (l-p)E(Var(f(X,Y)\X))} . 

Proof. Suppose X ~ wx, X'\X = x ~ Px(x, •) and (X, Y) ~ ^7. Then 
HPxIl = sup \E[f(X')f(X)]\ 

= sup Var(£(/(X)|y)) 

= 1- inf S(Var(/(X)|y)) . 

In the above, the second equality follows from Liu, Wong and Kong (1994, Lemma 
3.2) and the last equality holds since for / £ L\ \ (wx) 

1 = £(Var(/(X)|y)) + Var(£(/(X)|F)) . 

Now consider \\Prgs\\- Suppose (X,Y) ~ w and {X' ,Y')\{X,Y) = (x,y) ~ 
Prgs((x,v), ■)• Tncn 



h(x' ,y')h(x,y)k RGS (x' ,y / \x,y)TT(x,y)n x (dx / )n Y (dy')nx(dx)fi Y (dy) 

h(x' : y')h(x,y)TT(x,y){pTr xlY (x'\y)5(y' - y) 
+ (1 - x)]fi x (dx')fiY(dy')nx(dx)fi Y (dy) 

= J ph(x',y)h(x,y)n xl Y(x'\y)TT(x,y)nx(dx')n x (dx)^ Y (dy) 

+ J (1 -p)h(x, y')h(x,y)Tr Y \ x {y'\x)iT(x 7 y)fiY{dy')fix{dx)nY(dy) 

= J ph(x,y)E[h(X' ,Y)\Y = y]Tr(x,y)fj,x(dx)fx Y (dy) 

+ [{l-p)h{x,y)E[h{X,Y')\X = x\-K{x,y)nx{dx)n Y {dy) 
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ph(x,y)E[h(X' ,Y)\Y = y]-K X \ Y (x\y)iT Y {y)nx{dx)iJ, Y {dy) 

+ J {1 - p)h{x,y)E[h(X,Y')\X = x]Tr Y \x(y\x)nx{x)fix(dx)fi Y (dy) 

pE[h(X,Y)\Y = y]E[h(X',Y)\Y = y]^ Y {y)^ Y {dy) 

+ J (l-p)E[h(X,Y)\X = x]E[h(X, Y')\X = x]n x (x)(i x (dx) 

p(E[h{X,Y)\Y = y])\ Y (y)» Y (dy) 

+ [ (1 - p)(E[h(X,Y)\X = x})\ x (x)» x {dx) 



pE 



(E[h(X,Y)\Y}f +(1- P )E {E[h(X,Y)\X]) 



Now since h € Lg 1 {w) , 



and 

Moreover, 



Vfa[E[h{X,Y)\Y]) = E[(E[h(X,Y)\Y}) 2 } 



Var(E[h(X, Y)\X]) = E[(E [h(X, Y)\X]f] . 



1 = VaXn{h(X,Y)} = Vax{E[h{X,Y)\Y]) + E(Vax[h(X,Y)\Y]) 

and 

1 = Vax m [h(X,Y)] = Vax(E[h(X,Y)\X}) + E(Vax[h(X,Y)\X]) . 

The result follows easily. □ 

Proposition 2. Suppose there exists a sequence {hi G Lq 1 (zu x )} such that if 
(X, Y) ~ zu, then 

(8) liminf E[Var(hi{X)\Y)} = . 

i— >oo 

Then \\Px\\ = II-PrgsII = 1- Hence $ x , &rgs and &dgs are subgeometrically 
ergodic. 

Proof. The claim that \\Px\\ = 1 follows easily from the first part of Lemma 2. Now 
consider ||Prgs|!- Note that if f'(x,y) := f(x) E L^zux), then /' e L% i(m). 
From the second part of Lemma 2 we have 

\\P RGS \\ = 1 - inf {pE(V a v[f(X,Y)\Y}) + (1 - p)£(Var[/(X, Y)\X])} . 

The claim now follows easily since if f(x,y) = hi(x), then 

E(Vax[f(X,Y)\X}) = E(Vax[hi(X)\X]) = 

and 

E(Vax[f(X,Y)\Y]) = E(yax[hi(X)\Y]) . 

Thus we conclude that § x and Qrgs are subgeometrically ergodic. Since & x and 
&DGS are either both geometrically ergodic or both subgcometric, it follows that 
$ dgs also converges subgeometrically. □ 
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5. A Discrete Example 

We introduce a family of simple discrete distributions which admit usage of the 
TGS algorithms. We then apply our general results which will allow us to very 
nearly characterize the members of the family which admit geometrically ergodic 
TGS Markov chains. 

Let {ai}°^L 1 and {bi}°^ 1 be strictly positive sequences satisfying 

oo oo 

X>+X> = i- 

i=l i=l 

Also, let 60 = 0. Let the family consist of the discrete bivariate distributions having 
density 7r with respect to counting measure on N x N given by 

a x x = y, y = 1,2,3, . . . ; 
n( x , y) = { b v x = y+ 1, y = 1,2,3,... ; 
otherwise . 

Hence the marginals are given by 

00 00 

tt x (x) = y] tt(x, y) = y] a x I(x = y) + b y I(y = x -l) = a x + b x -i 

and 

00 00 

^y(y) = ^2 n(x, y) =^2 a x I(x =y) + b y I(y = x + 1) = a y + b y . 

x — 1 x—1 

The full conditionals are easily seen to be 

*x \y{x\y) = -^t-I(x = y) + -^-r-I(x = V + 1) y = 1,2,3,... 
a y + by a y + by 



and 



ky\x(v\x) = — ri — I ( x = v) + — ~r — I (y = x-l) x = 1,2,3,. 

o-x + o x -i a x + o x -i 



Define 



a> x b x , a x -ib x - 

Px = 7 7 T7 — r and 



\&x ~r X — 1 ){a x +b x ) {a x + 6 x _i)(a x _i + 6 x _i) 

Then for the DGS 

kDGs(x',y'\x,y) = ■Kx\Y{x'\y)ir Y \x{y'\x') 
and hence for the marginal chain (&x 



kx{x'\x) = }^x\r{x'\y)^Y\x(y\x) = < 

y=i 



1 — p± x' = x = 1; 

p x x' = x + 1, x > 1; 

q x x' = x — 1, x > 2; 

1 — p x — q x x' = x, x > 2; and 

otherwise . 
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It is easy to see that the kernel Px is Feller. If p £ (0, 1), then for the random scan 
Gibbs sampler (RGS) we have 

kRGs(x',y'\x,y) = pir X \ Y (x'\y)S(y' - v) + (1 ~ p)^Y\x(y'\x)S{x' - x) . 
Since for any open set O 

oo oo 

P RGS ((x,y),0)= P Y, ^x\Y(x'\y)I{{x',y) G 0)+(l-p) £ ir Ylx (y'\x)I((x,y') g O) 
x'—l y' — l 

it is easy to see that Prgs(^ O) is lower semicontinuous and hence $rgs is Feller. 

We are now in position to establish sufficient conditions for the geometric ergod- 
icity of $x, ®dgs and $rgs- 

Lemma 3. If 

Px 

(9) limsup — - < 1 and \immiq x > 0, 

x— ►oo Qx a;— ►oo 

then $x, &dgs an d Qrgs o^e geometrically ergodic. 

Proof. We need only verify the conditions of Proposition 1 and we've already seen 
that both Px and Prgs are Feller. Set V(x) = z x for some z > 1 which will be 
determined later and note that 



G(y)=Y^V(x)TT XlY (x\y) 



x=l 



a v + zb h 



ly + by 



1 yV 



For any d > 0, the sublevel set Ad := {a; : V(a;) < d} = {x : z x < d} is bounded. 
Since V is a continuous function, Ad is also closed, hence compact. Therefore V is 
unbounded off compact sets on X. On the other hand, for any d > 0, the sublevel 
set E>d := {y : G(y) < d} c {y : z y < d} is bounded. Then for any 6 > 0, 
W(x, y) = V(x) + bG(y) is unbounded off compact sets on X x Y because for any 
d > 0, {(x, y) : W^(a;, j/) < d} C A d x B d is bounded and closed, hence compact. 
Now, all that remains is to construct a drift condition for V. Note that for x > 2, 



P x V(x) = J2 z x 'kx{x'\x) 



(10) 



/', '-' ' • '/, '-' + (1 -p x - Qx)z a 
Qx 

zPx~\ h 1 - p x - Qx 

z 

p x (z-l) + q x ( 1-1) +1 V{x) 



We next try to bound the coefficient of V(x) in the right hand side of (10) for all 
large values of x. Set 

Px 

r := limsup — and q := liminf q x 

x — ► oo qx x ~ 

and note that r < 1 and q > by assumption. Then there exists xq > 2 such that 
Px ^ r + 1 q 

— < and q r > — tor all x > Xn . 

q x 2 2 
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For any z £ (1, 2/(r + 1)) and x > xq , 

Px (z - 1) + q x ( - - lj + K -^-g*(« - 1) + ^ — - + 1 

= fc (,-u(n±i-i) + i 

since z£ (1, 2/(r + 1)) implies 

^--<o. 

2 z 

Next note that 

1 - r 1 r + 1 1 

0< g < 1 , < z - 1 < < 1 and - - < — < 0, 

1 + r 2 2 2 



which guarantees 



o<i(.-i>04±-i - H 



Thus there exists < /? < 1 such that 

(id |(,_i ) (n+l_I) + i< p < 1 . 

Finally, to bound PxV{x) for x < Xq, set 

(12) L := m&xP x V(x) . 

X<XQ 

Putting together equations (10) to (12), we have 

PxV(x) < pV(x) + L 
with < p < 1 and L < oo. The conclusion now follows from Proposition 1. 



□ 



The above sufficient condition for geometric ergodicity involves transition proba- 
bilities of the chain $x . Alternatively, we could state a sufficient condition in terms 
of the probabilities {aj, bi} which define the density tt. 

Define 

A := limsup — — ; m := liminf — - ; and M := limsup — - . 

i— >oo Q>i—1 l—>oc Oi ?'— >oo ®i 



Corollary 1. If 



,. CLi bi 

limsup < oo, limsup — < oo 

i— too — 1 i— >oc &i 



id A(l + M)/(l + m) < 1, then ^DGS> an d ^RGS are geometrically ergodic. 
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Proof. We verify the conditions of Lemma 3. Note that 

bi-x Oj_i 1 1 



Hence 



lim inf qi > 

i— >oo 

Next observe that 



'oo- " 1 + lim sup^ ^ 1 + l im sup^ ^ 



> 



Pi 



a l 6 l a 4 _i + bi-i otj 



6<_i 



a,_i bi-i ai + b t 



Oi-l 1 



Hence 1 



hm sup — < 



lim sup ■ 



o»-i 



1 +limsup i _, 00 j±± 



1 + lim inf, 



= A- < 1 

1 + m 



□ 



So far in this section, we have used Proposition 1 to get sufficient conditions for 
the geometric ergodicity of the Markov chains. Next, we use Proposition 2 to study 
the conditions under which the Markov chains are subgeometrically ergodic. 

Lemma 4. The Markov chains &DGS an d &RGS are subgeometrically ergodic 
if any one of the following conditions hold: 



1. 



2. 



3. 



lim sup " = oo ; 

i— >oc O'i—l 



r Y^=i{ a x + b x ) 

lim sup — = oo 5 or 

i— >oo Oi — 1 



lim sup — = oo 



Proof. Let (X, Y) ~ w. For i = 1, 2, 3, . . . let Hi(x) = I(x > i). Then 

OO 

Hi := E[Hi{X)\ = E[Hf(X)] = + 6 a _0 < oo 

x—i 

and 

v,i := Var[i?,(X)] = yU 4 (l - Hi) < oo . 
Define ft.j(a;) = [Hi(x) — Hi]/\/vi and note that hi £ Lq ^(vjx)- We will show that 

lim inf E[Var(hi{X)\Y)} = , 

i— »oo 

and appeal to Proposition 2 for the conclusion. Let 

b v 



Cly + 6j, 



■xx\Y{y + i|y) 
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Then 



E[H l (X)\Y = y]=E[Hf{X)\Y = y] 

= K X \Y(y\y)H t (y) + TT XlY (y + l\y)Hi(y + 1) 
'0 y<i-2, 
A~i y = i - 1, 

1 y > i . 



Hence 

Var[Hi(X)\Y = y] 

Therefore, 



otherwise 



E(Var{Hi(X)\Y = y}) = Y J ^Y{y)Var[H l {X)\Y = y] 

= n Y (i - l)Var[Hi(X)\Y = i - 1] 
= (ai_i + 6i_i)A-l(l-A-i) 

Finally, 

-1 Cti-l&i-l 



E[Var(hi(X)\Y)] = v^ElVariHiiX^Y)} = [^(1 - ^)\ 



Note that 



= (1 - Mi) 
= (1 - Mi) 



1 1 



^(a K + 63;) + bi-i 



a;— i J 

<ii-i &i— i oti-i 



and that 

i-i 



lim (1 - /Xj) = lim VYa^ + = 1 

2— >-00 i— >QO * * 

X = l 

Hence equation (8) holds if and only if 

hm sup = oo, 

i— >oo G'i—l 

or hm sup — = oo , 

i— >oo Vi — 1 

or lim sup — = oo . 

i— >oo 



□ 
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Finally, we can use the previous results to characterize the conditions for ge- 
ometric ergodicity of TGS Markov chains for a large subfamily of our discrete 
distributions. 



Corollary 2. Assume that both A := lim^. 
the limits below are well defined and the following statements are equivalent: 



- and limi-j.oo exist. Then all 



(a) 
(b) 



lim 



< CO, 



lim — < oo, 



lim — < 1 

i— >oo 



id A < 1 . 



lim qi > . 



(c) <&x is geometrically ergodic. 

(d) $ dgs is geometrically ergodic. 

(e) <&rgs is geometrically ergodic. 

Proof. As we noted in Section 2.1, the equivalence of (c) and (d) is well known, 
(a) =S> (b): Note that 



and 



lim qi 



Pi 



1 



lim 

i— >oo 



1 + lim 



lim 



a.i . . 
3o ^— - l + lim 

1 + lim. 



a»-l 

30 



1 + lim^oo fi 



> 



A < 1 



(b) =>■ (c) and (b) =>■ (e): The same argument holds for $x and Qrgs- Immediate 
by Lemma 3. 

(c) =>■ (a) and (e) =4> (a): The same argument holds for <&x and Qrgs- If the chain 
is geometrically ergodic, then lim^oo ^-j- < co and lim^oo ^ < co by conditions 
2 and 3 of Lemma 4. Next, if A = 1, then for any fixed positive integer ii', we have 



lim 



o»+i 



1 , lim 



1. 



lim 



a<i+K 



Then there exists io such that for any i > io, 

Oj+l 1 1 

Oi 2 ' a,i 2 ' 



di 2 



Hence, given any K, there exists io such that for any i > io, 

K 



Oi-l 



which implies 



lim sup x = oo . 



di-i 



Thus by condition 1 of Lemma 4, the chains are subgeometrically ergodic-a con- 
tradiction of (c). So A 7^ 1. But A cannot be greater than 1 either since otherwise 
E^Li a * = 00 wri ich contradicts the fact that Ex=i a x + Ex^=i &x = !• Therefore, 



A< 1. 



□ 



imsart-coll ver. 2011/11/15 file: TanJonesHobert.tex date: March 8, 2013 



16 Tan et al. 

To better understand the conditions for geometric ergodicity provided in Corol- 
lary 2, we hereby explain its condition (a) explicitly. First, the requirement that 
A = lim^oo < 1 implies that for any < A\ < A < A2 < 1, there exists io 

such that for any i > io, at/ai-i <G (^1,^2), hence at <G (ai A\~ l ° , ai A 2 ~ l °). In 
other words, the sequence {a^} decays at a geometric rate as i increases. Secondly, 
the requirements lim^oo ^-j- < 00 and limi_ i . 00 ^- < 00 imply that there exist 
< -Bi,£>2 < 00 such that, for any i > io, Oj+i/fej < B\ and &j/a, < B2, hence 
bi e (aj+i-Bi, diBz) C (aiAiBi,a,iB2). That is, 6^ = O(ai) as i — > 00. In summary, 
Condition (a) requires that the sequences {a^} and {bi} both decay geometrically 
at the same rate as i increases. 

We close this section by considering four concrete examples. 

Example 1. Let a x = c\x~ d and b x = C2X~ d where d > 1 and (ci +C2) S^Li ^ = 
1. Then both lim^oo and lim^oo |f exist, with A = 1. Therefore, $x, $ 



DGS 



and $ags are subgeometrically ergodic by Corollary 2. 

Example 2. Let c satisfy (1 + c)e _1 /(l — e _1 ) = 1. Set a x = 
Then both lim^oo ^-j- and lim^oo p- exist, with A = e -1 < 1. Furthermore, 

lino. sup^oo jt*- = lirtij-^oo ce -1 < 00 and limsup^^ ^ = c _1 < 00. Therefore, 
®x, ®dgs and Qrgs are all geometrically ergodic by Corollary 2. 
Example 3. Let c satisfy ce _1 /(l — er 1 ) + e~ 2 /(l — e~ 2 ) = 1. Set a x = ce~ x and 
6 X = e~ 2x . Then both lim^oo and lim^oo ^ exist. Also, 

limsup — — = lim ce 4 ~ 2 = 00 . 

i-s-oo Oi-l 

Therefore, &dgs and $ijGS are subgeometrically ergodic by Corollary 2. 
Example 4. Let c satisfy ce _1 /(l — e~ x ) + e~ 2 (l — e~ 2 ) = 1. Set 

.2: even I e _2a; a; even 

and Oa; = < 
x odd \ce x x odd 

Then lim^oo ^ does not exist. Hence Corollary 2 is not applicable. Instead we 
have to use Lemma 4. Notice that 

limsup — > lim = hm _ . . = Urn ce T = 00 

and hence $x, 'I'.dgs and 3>_rgs are subgeometrically ergodic. 
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